Feature extraction and clustering for the computer-aided reconstruction of strip-cut shredded documents

نویسندگان

  • Anna Ukovich
  • Giovanni Ramponi
چکیده

bstract. We propose a solution for the computer-aided recontruction of strip-cut shredded documents. First of all, the visual conent of the strips is automatically extracted and represented by a umber of numerical features. Usually, the pieces of different pages ave been mixed. A grouping of the strips belonging to a same page s thus realized by means of a clustering operator, to ease the sucessive matching performed by a human operator with the help of a omputer. © 2008 SPIE and IS&T. DOI: 10.1117/1.2898551

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reconstructing Shredded Documents

This project looks at the challenges involved in the automatic reconstruction of strip (vertically cut) and cross (both vertically and horizontally cut) shredded documents. The unshredding problem is of interest in the fields of forensics, investigative sciences, and archaeology. All stages of the unshredding pipeline are analysed, starting from scanned images of shreds and ending with reconstr...

متن کامل

An alternative clustering approach for reconstructing cross cut shredded text documents

In this paper, we propose a clustering approach for solving the problem of reconstructing cross-cut shredded documents. This problem is important in the field of forensic science. Unlike other clustering approaches which are applied as a preprocessing step before the actual reconstruction algorithms, our clustering approach is part of the reconstruction process itself. We define a new cost func...

متن کامل

A Memetic Algorithm for Reconstructing Cross-Cut Shredded Text Documents

The reconstruction of destroyed paper documents became of more interest during the last years. On the one hand it (often) occurs that documents are destroyed by mistake while on the other hand this type of application is relevant in the fields of forensics and archeology, e.g., for evidence or restoring ancient documents. Within this paper, we present a new approach for restoring cross-cut shre...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • J. Electronic Imaging

دوره 17  شماره 

صفحات  -

تاریخ انتشار 2008